feat(client): add Dynamo inference backend by biswapanda · Pull Request #2773 · PrimeIntellect-ai/prime-rl

biswapanda · 2026-06-11T17:04:07Z

Overview:

Adds NVIDIA Dynamo as an optional inference backend alongside the existing vLLM path. Controlled by a new ClientConfig.backend field ("vllm" | "dynamo"). Three self-contained changes: a pluggable AdminAPI abstraction, renderer_transport selection for the verifiers wire shape, and a Dynamo teacher-logprobs path for OPD training.

Details:

packages/prime-rl-configs/src/prime_rl/configs/shared.py

ClientConfig.backend: Literal["vllm", "dynamo"] — selects the AdminAPI implementation and verifiers wire shape. Default "vllm" is a no-op for existing configs.
ClientConfig.rl_base_url — optional override for the Dynamo RL worker discovery listener (GET /v1/rl/workers). When unset, the port is derived from DYN_RL_PORT (default 8001).

src/prime_rl/utils/client.py

AdminAPI Protocol + VLLMAdminAPI — extracts the existing vLLM admin paths (/pause, /resume, /update_weights, /load_lora_adapter, /init_broadcaster) into a typed protocol. VLLMAdminAPI methods go through a shared _admin_post helper that adds bounded per-attempt timeouts and tenacity retry on 5xx/transport errors (300 s for pause/resume, 720 s for weight updates).
DynamoAdminAPI — Dynamo worker admin over POST /engine/<method>: pause_generation, resume_generation, update_weights_from_disk / update_weights_from_distributed (filesystem vs NCCL paths), load_lora_adapter. Inherits health/model checks from VLLMAdminAPI.
setup_admin_api(client_config) — picks DynamoAdminAPI when backend="dynamo", VLLMAdminAPI otherwise.
discover_dynamo_admin_base_urls — resolves worker system URLs from GET /v1/rl/workers; falls back to port-replaced base_url when rl_base_url is unset.
setup_clients — sets renderer_transport="dynamo_chat" on all vf.ClientConfig objects when backend="dynamo", "vllm_generate" otherwise. Requires verifiers #1574 + renderers #79.

src/prime_rl/orchestrator/utils.py

Splits compute_teacher_logprobs into two paths dispatched on client_config.renderer_transport: _compute_teacher_logprobs_vllm (existing /inference/v1/generate path) and _compute_teacher_logprobs_dynamo (POST /v1/chat/completions with nvext.token_data + nvext.extra_fields=["prompt_logprobs"]).
_flatten_prompt_logprobs — shared flattener that handles both vLLM typed Logprob objects and Dynamos dict shape {logprob, rank?, decoded_token?}.

Where should the reviewer start?

src/prime_rl/utils/client.py — AdminAPI protocol (line ~32), DynamoAdminAPI class, setup_admin_api, and setup_clients renderer_transport selection. Core of the change.
src/prime_rl/orchestrator/utils.py — _compute_teacher_logprobs_dynamo and the compute_teacher_logprobs dispatcher. Note the placeholder messages field required by the Dynamo frontend even when nvext.token_data is set.
packages/prime-rl-configs/src/prime_rl/configs/shared.py — the two new ClientConfig fields; verify defaults are backward-compatible.

Related Issues:

Relates to verifiers #1574 — adds renderer_transport field to vf.ClientConfig
Relates to renderers #79 — adds dynamo_chat transport to renderers.generate()

Note

Medium Risk
Changes the weight-update and NCCL initialization paths when backend=dynamo, but default vllm behavior is preserved; misconfigured discovery or engine RPC could break training on Dynamo deployments.

Overview
Adds NVIDIA Dynamo as an optional inference backend via ClientConfig.backend ("vllm" | "dynamo", default unchanged) and optional rl_base_url for RL worker discovery.

Admin layer: Inference admin is refactored behind an AdminAPI protocol with VLLMAdminAPI (existing /pause, /update_weights, etc.) and DynamoAdminAPI (POST /engine/*, filesystem vs NCCL weight updates, LoRA via load_lora). Health, model checks, weight updates, LoRA load, and NCCL init all route through the selected implementation.

Dynamo wiring: When admin_base_url is unset, worker system URLs are discovered from GET /v1/rl/workers (port from rl_base_url or DYN_RL_PORT). Static pools retry discovery in wait_for_ready; elastic pools pin each pod’s admin client to the matching system_url by IP/DNS.

Rollouts & OPD: setup_clients sets renderer_transport to "dynamo" for the nvext wire shape. compute_teacher_logprobs dispatches to vLLM /inference/v1/generate or Dynamo chat completions with nvext.token_data. The orchestrator passes weight_broadcast.type into DynamoAdminAPI for NCCL vs disk updates.

Elastic: Separate model HTTP clients on the OpenAI-compat URL while admin hits the system server; backend is preserved when rebuilding train clients.

^{Reviewed by Cursor Bugbot for commit 3c41ee3. Bugbot is set up for automated code reviews on this repo. Configure here.}

…er transport

…s request

…mo admin to worker system URL

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit a31c60b. Configure here.}

biswapanda added 3 commits June 11, 2026 04:21

feat(client): add dynamo backend selector, DynamoAdminAPI, and render…

cea2fb9

…er transport

feat(orchestrator): compute teacher logprobs over dynamo nvext transport

5fa41bb

fix(orchestrator): send placeholder message in dynamo teacher-logprob…

1a25d08

…s request

biswapanda changed the title ~~Dynamo integration~~ feat(client): add Dynamo inference backend — AdminAPI, renderer transport, teacher logprobs Jun 11, 2026

cursor Bot reviewed Jun 11, 2026

View reviewed changes

Comment thread src/prime_rl/utils/client.py

Comment thread src/prime_rl/utils/client.py

Comment thread src/prime_rl/utils/client.py

Comment thread src/prime_rl/utils/client.py

fix(dynamo): wire NCCL weight-broadcast path and harden admin timeouts

52f8703

cursor Bot reviewed Jun 11, 2026

View reviewed changes

Comment thread src/prime_rl/utils/client.py

Comment thread src/prime_rl/utils/client.py

biswapanda mentioned this pull request Jun 11, 2026

feat: dynamo inference backend integration #2737

Open

1 task

biswapanda changed the title ~~feat(client): add Dynamo inference backend — AdminAPI, renderer transport, teacher logprobs~~ feat(client): add Dynamo inference backend Jun 11, 2026

fix(dynamo): pass env headers in admin discovery and pin elastic dyna…

a90b642

…mo admin to worker system URL

cursor Bot reviewed Jun 11, 2026

View reviewed changes

Comment thread src/prime_rl/utils/elastic.py

Comment thread src/prime_rl/utils/elastic.py

Comment thread src/prime_rl/utils/elastic.py Outdated

fix(dynamo): route elastic model checks to inference clients

a31c60b

cursor Bot reviewed Jun 12, 2026

View reviewed changes

Comment thread src/prime_rl/utils/client.py

biswapanda added 2 commits June 12, 2026 02:14

fix(dynamo): retry static admin discovery until ready

2c5f277

chore(client): rename renderer transport values

3c41ee3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(client): add Dynamo inference backend#2773

feat(client): add Dynamo inference backend#2773
biswapanda wants to merge 8 commits into
PrimeIntellect-ai:mainfrom
biswapanda:dynamo-integration

biswapanda commented Jun 11, 2026 •

edited by cursor Bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

biswapanda commented Jun 11, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Where should the reviewer start?

Related Issues:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

biswapanda commented Jun 11, 2026 •

edited by cursor Bot

Loading